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Abstract 

A coarse-grained variational model is used to investigate the polymer dynamics of barrier crossing 
for a diverse set of two-state folding proteins. The model gives reliable folding rate predictions 
provided excluded volume terms that induce minor structural cooperativity are included in the 
interaction potential. In general, the cooperative folding routes have sharper interfaces between 
folded and unfolded regions of the folding nucleus and higher free energy barriers. The calculated 
free energy barriers are strongly correlated with native topology as characterized by contact order. 
Increasing the rigidity of the folding nucleus changes the local structure of the transition state 
ensemble non-uniformly across the set of protein studied. Neverthless, the calculated prefactors ko 
are found to be relatively uniform across the protein set, with variation in 1/ko less than a factor of 
five. This direct calculation justifies the common assumption that the prefactor is roughly the same 
for all small two-state folding proteins. Using the barrier heights obtained from the model and the 
best fit monomer relaxation time 30ns, we find that 1/ko ~ 1 — 5/is (with average 1/kO ~ 4/xs). 
This model can be extended to study subtle aspects of folding such as the variation of the folding 
rate with stability or solvent viscosity, and the onset of downhill folding. 
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Folding in small proteins is often well characterized as a cooperative transition between 
two well-defined structural populations: an unstructured globule ensemble and a structured 
folded ensemble. The transition rate between free energy minima is controlled by the dy- 
namics of passing through an unstable transition region determined by saddlepoints in the 
free energy surface. Accordingly, the rate is expected to follow Arrhenius form 

k f = k e-^ F \ (1) 

where (3 = 1/k-oT is the inverse temperature and AF^ is the free energy difference between 
the unfolded and transition state ensembles. The exponential factor in Eq.flTJ reflects the 
equilibrium population of the transition state ensemble relative to unfolded ensemble and 
the prefactor, ko, is the timescale associated with the dynamics of crossing the free energy 
barrier. Successful identification of specific residues structured in the transition state en- 
semble by several different theoretical models ±2iiL£i!£&iL!L and numerous simulation studies 
(ref.9,10,11 and references therein) has established that the topology of the native structure 
determines the folding mechanism of these proteins. In addition, two-state folding rates are 
well correlated with very simple measures of the native state topology such as contact or- 
der— dAM.dk. While additive potentials often produce reasonable structural characterization 
of the transition state ensemble, the range of simulated folding rates and their relationship 
with contact order does not agree with experiment^. In this paper we present direct folding 
rate calculations that capture the trends first noted in lattice models^ 1 ^ and very recently 
in continuum models^ 1 ^ leading several groups to speculate that the behavior of folding 
rates indicate enhanced structural cooperativity. 

The term "structural cooperativity" usually refers to a mechanism by which the presence 
of a structured region makes additional order more favorable. For example, cooperativity is 
greater when a contact between two residues is more stabilized after one of the partners is 
already ordered. The additional stability is most naturally introduced through local attrac- 
tive multibody interactions associated with coarse-grained potentials^^^i. Non-additive 
potentials can also be neutral with respect to disordered and ordered residues and still in- 
crease cooperativity. For example, destabilizing partially ordered residues near structured 
resides generically sharpen the interface between folded and unfolded regions by increasing 
the surface energy of the folding nucleus^ 1 ^. Even purely repulsive interactions can enhance 
cooperativity. A good example of this is the "induced rigidity" (enhanced helical order) due 
to liquid crystal ordering in dense polymer solutions^ 1 ^ and globular helical proteins^. 

In the present analytic model, cooperativity is introduced through repulsive, excluded 
volume interactions between residues in proximity to native contact pairs. This potential is 
effectively "neutral" since it primarily destabilizes partially ordered residues at the interface 
of the folding nucleus. The cooperative term of the potential is pairwise additive in the 
space of all contacts, but corresponds to an effective multi-body potential when projected 
onto the set of native contacts. The particular form of cooperativity was developed so that 
the calculated barrier heights remain robust with respect to variations of excluded volume 
strength in the original variational model^. 

Experimental evidence supporting a specific decomposition of the folding rate into the 
dynamic and thermodynamic factors in Eq. [T]is necessarily indirect^. While structural pre- 
dictions from models with a strong native state bias (Go-models) are robust, the value of 
the barrier height (and corresponding absolute timescale 1/fco) is more sensitive to details 
of the model^ 1 ^. The predicted prefactor is commonly assumed to be roughly uniform for 
different proteins with a magnitude of O(0.1-l/is -1 )— >2iy&2i though prefactors as large as 
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O(100/is _1 ) have also been proposed recently^ 1 ^. While the precise value of the prefactor 
is a sub-dominant determinant of the absolute rate, accurate estimation gives an important 
reference timescale essential, for example, to identify the the fastest measured rates as down- 
hill (or barrier-less) folding^^^. Calculations for 28 two state proteins presented in this 
paper predicts the prefactor is relatively uniform on the order 0(l/is _1 ), largely indepen- 
dent of differences in the absolute folding rates or the native state topology. Furthermore, 
predicted folding rates agree with experimental trends provided interaction terms favoring 
modest structural cooperativity are included in the model. In particular, the relationship 
between barrier heights and contact order is found to be a consequence of relatively rigid 
folding nuclei. 

Model: excluded volume and cooperativity 

The variational model developed by Portman, Takada, and Wolynes^ has proved reliable 
in predicting the structure of the transition state ensemble of individual proteins at the 
residue level of resolution^L^. In this model, the free energy of partially ordered ensembles 
of polymer configurations is developed through a reference Gaussian chain inhomogeniously 
constrained to the native positions by N harmonic variational constraints {C}. A summary 
of the variational model is given in the Supplemental Material. Here, we focus on how 
enhanced cooperativity can be realized by the addition of repulsive interactions between 
non-native contacts. 

We divide the energy into two contributions 

^K^}] = E^ nat (^))o+ E < mCOOP m>o (2) 

NAT NON-NAT 

where the subscript indicates an average over the reference Hamiltonian. The first term rep- 
resents attractive interactions between monomers that are neighbors in the native structure 
(i.e., the Go-model assumption). The second term, which is new to the model, represents 
excluded volume interactions between non-native contact pairs. Before explaining the con- 
sequences of these repulsive interactions, we first motivate the need for this contribution by 
considering the native contact potential in the original model. 

The form of the interactions between native contacts is the sum of three Gaussians for 
convenience: tt nat (r.y) = e^- ^a=i i s laG~ aaVij where is the strength of the interactional. 
Repulsive intermediate- and attractive long-range Gaussians sum to give a potential well with 
minimum w nat (r m i n ) = —1 at the distance r m i n = 6A. The short-range Gaussian represents 
excluded volume for native contact pairs; we choose 7; for each contact to give the same 
strength at zero distance, £7(0). The finite strength of the repulsion at r = is an artifact 
of the potential (and the finite native monomer density at short range), so there is some 
ambiguity in determining the appropriate value for £7(0). This is troubling since it was 
found that the calculated barrier height is sensitive to the value of £7(0), even though the 
structure of the transition state ensemble is relatively robust for most proteins. 

The sensitivity of AF^/k^Tf on the excluded volume strength £7(0) indicates that the 
cooperativity in the model is relatively low. This can be understood by considering the 
short-distance pair density of a partially ordered chain, n^r) = (5(r — ry))o. Integrating 
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over angles gives the radial pair density 
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where the correlations Gij = (Sr^ ■ 5rj) /a 2 and 6Gij = Gu + Gjj — 2Gij is the magnitude of 
the fluctuations about the relative mean separation s$j = YK^ij ~ Gjk)Ckr^;a,nd a — 3.8Ais 
the distance between adjacent monomers. The weight at short distances (r < r , r is 
excluded volume interaction length scale) is small when the pair is sufficiently delocalized 
(r <C a^5Gij) or sufficiently localized (a^SGij <C Sy). Consequently the sensitivity of 
the barrier height on the short distance repulsion is due to the partially ordered residues 
in the transition state ensemble. Increasing the cooperativity by destabilizing partially 
ordered residues makes the barrier height less sensitive to £7(0). To this end, we modify 
the interactions between native contacts through a repulsive potential between residues 
in close proximity to native contacts u coop (rij)/eo = U(0) exp [— a s r^-] . This term increases 
cooperativity of partially structured ensembles by encouraging residues surrounding a native 
ordered pair to either be delocalized or ordered themselves to reduce short-distance density 
overlap. 

When viewed as an effective potential involving only native contact pairs, the repulsion 
between non-native contacts effectively induces local multi-bodied interactions. Due to 
chain connectivity, structured regions that are sufficiently non-local in sequence have greater 
cooperativity (see Fig. [1]). Accordingly, we define a reduced set of non-native contacts: for 
every native pair with \i — j\ > 12 we include pairs within a window [i ± 4, j ± 4] and 
eliminate duplicates or native contact pairs from the sum. With this convention, the barrier 
height varies less than about l-2kv,T{, over a wide range of U(0) (5 < U(0) < 60). In the 
following we take U(0) = 50. 

The parameters of the model are the same as given in^, except: (i) the magnitude U(0) 
is fixed for each contact; (ii) cooperativity term; (iii) the radius of gyration of the globule 
is set by the chain length according to the scaling law given in^. We note that with 
these parameters, the folding route for A-repressor studied in^, is structurally similar the 
cooperative folding route, though the barrier is approximately two times larger. 



Results 

Folding rates and prefactors. We calculated the prefactors and folding routes of 
28 two state folding proteins. The corresponding folding rates at the transition midpoint, 
kf = k e~ AF ^ kBTt , are shown in Fig. [2j Calculated rates in absolute units depend on 
the timescale set by the monomer relaxation rate a = 3Do/a 2 which we take as a fitting 
parameter. As shown in Fig. [21 the predicted and measured rates are well correlated (r = 0.8) 
with agreement within an order of magnitude for 80% of the proteins. 

The best fit monomer relaxation time 1/er = 30ns is on the order of the timescale of 
unfolding a helical segment^. With this microscopic timescale, the longest relaxation time 
of a chain of 100 monomers is approximately ~ O(10/xs) which compares well^ with 
the timescale for the fastest collapse kinetics measured in proteins and polypeptides^ 1 ^ 2 - 1 ^. 
On the other hand, \ja is an order of magnitude slower than estimates obtained from an 
effective diffusion coefficient inferred from loop closure experiments of small peptides (~ Ins) 
and two orders of magnitude slower than estimates from bare diffusion coefficients of the 
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monomer (~ 100 ps) 44 . The source of small effective diffusion coefficients associated with 
simple Gaussian models is not fully understood 4 ^^. Nevertheless, results from recent 
experiments on small peptides under different solvent conditions indicate that intrachain 
interactions (that can be interpreted kinetically as a kind of internal friction) induce local 
activation barriers that renormalize the effective monomer diffusion coefficient^^. Although 
controversial, internal friction may explain why the speed limit of protein folding is fixed at 
~ 0(0.1-1. 0/is)^. 

Even though the prefactor of each protein is calculated individually, its value in absolute 
units ultimately depend on the calculated barrier heights through the microscopic timescale 
a. Relative prefactors, on the other hand, are independent of this fitting parameter. As 
shown in Fig. [3], the distribution of prefactors is relatively uniform, varying within a factor 
of 5 for most proteins. Using the fitted value for a, r = 1/fco varies mainly between 1/is 
to 5/xs with an average f = 4/is. Given this narrow distribution, it is not surprising that a 
uniform prefactor of k$ = (4/is) -1 gives essentially the same correlation to the measured and 
predicted rates (data not shown). Thus, direct calculation of the barrier crossing dynamics 
gives solid evidence supporting the common assumption that the folding rate prefactor is 
largely independent of topology. Recent work by Henry and Eaton also suggests the prefac- 
tor is relatively uniform across two-state folding proteins based on analysis of folding rates 
from a different set of analytic models 3 -. The value for the average prefactor ko ~ 10 5 s _1 
agrees within an order of magnitude with estimates based on semi-empirical and theoretical 
models^ 1 - 1 ^ as well as analysis of thermodynamic data from differential scanning calorime- 
trj*£. This value also is consistent with the fastest measured rates ~ 1/xs, if the timescale 
for downhill folding is approximated by the Arrhenius rate with a vanishing barrier—^. 

A closer look at the two proteins (limb and lpks) with exceptionally small calculated 
prefactors reveals that in each case the unstable mode becomes degenerate at a stability near 
the transition midpoint. The structure of the transition ensemble changes sharply, though 
continuously, as the temperature crosses the degenerate point. In particular, the curvature 
of the unstable mode (and consequently the calculated prefactor) sharply vanishes in a cusp 
catastrophe^ 4 -. Away from these isolated temperatures, the prefactors return to the range 
exhibited by the majority of the proteins studied. Several of the proteins studied have similar 
rapid changes of the transition state as a function of temperature, occurring at temperature 
sufficiently far away from the midpoint so that the prefactor is relatively unaffected near Tf . 
In this high dimensional model, catastrophes can be generally expected as local minima and 
saddle-points merge at isolated values of the control parameters (e.g., temperature). The 
shape of the calculated prefactor versus temperature is thus determined by these degenerate 
points. For example, even for a route with a single transition state, the meta-stable unfolded 
or folded minimum disappear in a fold catastrophe at the limit of stability (spinodal) for both 
low and high temperatures^ 1 ^. If there are no other catastrophes, the calculated prefactor 
obtains a maximum at an intermediate temperature and vanishes at the spinodals. Near the 
maximum the prefactor varies much more slowly with temperature than near the spinodal. 
This generic shape of the prefactor is interesting since it can account qualitatively for non- 
linear dependence of the rate with stability (chevron turnovers) and may indicate kinetic 
signatures anticipating the onset of downhill folding. 

Nevertheless, interpreting these results requires some care. The harmonic expansion 
of the free energy is not expected to accurately reflect the global curvature of the free 
energy over ~ F* ± k^T when the local curvature is very small. For these cases, it is likely 
that the formalism should be modified away from strictly local curvatures to get accurate 
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estimates of the prefactor. Even if the renormalized prefactor is found to be relatively 
constant, the rapid change of the order parameter at the transition state that accompanies 
a catastrophe may itself account for cheveron roll-over, similar to the transition state 
switching mechanism suggested by Oliveberg^. The subtle variation of the prefactor and 
free energy barrier height with stability is an important issue that has yet to be thoroughly 
explored. 

Free energy profiles and folding routes. In the context of identifying kinetic trends 
for all two state proteins, a complete theory of the folding mechanism must reliably predict 
structural properties of the transition state ensemble in addition to absolute folding rates. 
The formation of local order along the folding route can be characterized by the degree of 
localization about the native positions 

Pi [{C}\ = <exp [-3a N (r, -rf) 2 /2a 2 ]> . (4) 

We refer to as the native density. Comparing the folding profiles and structural localization 
of the residues shown in Fig. 0] illustrates the cooperative nature of the folding routes induced 
by the repulsive non-native interactions. While the coarse grained structures of the transition 
state ensembles are similar for this protein, the residues order more gradually in the non- 
cooperative routes. Still, even for the cooperative route, the interface has a finite width as 
the structural ensembles retains some partial ordering of the residues. The sharper interface 
of the cooperative route is also accompanied by a significantly larger barrier. 

The effect of cooperativity on the structure of the transition state ensemble is complicated 
to describe in general. Cooperativity narrows the interface by destabilizing partially ordered 
residues in favor of either more ordered or more disordered. Whether a particular interfacial 
residue is excluded or incorporated into the folding nucleus is a subtle question, determined 
by the delicate balance between changes in entropy and energy due to localization. One 
measure to characterize changes in local structural order is the cross correlation 

^ Pcoop ' Pnon—coopi (5) 

where p coop or p non _ coop denote unit vectors with elements Pi for transition state ensem- 
bles with or without cooperativity, respectively. Fig. [5] shows the value Q for each protein as 
well as a typical example of the overlap of native densities evaluated at Q*. For 80% proteins 
studied, the overlap between the transition state ensemble structures is greater than 60%. 
Nevertheless, the variation of Q indicates this form of cooperativity does not effect every 
protein uniformly. 

Changes of the transition state ensemble can be also be characterized by the variation of 
the global order parameter AQ*. As shown in Fig. [5J the majority of the proteins studied 
have |AQ*| < 0.1. In terms of global order, the a-helical proteins are not very sensitive 
to cooperativity, though the local structure of the transition state ensemble can change 
significantly. For j3 and a/ (3 proteins, some systematic errors in the calculated barrier 
height can be associated with relatively large changes in the global order. For proteins with 
AQ* > 0.1 (lurn, lc8c, lpsf, lcsp), the model overestimates the barrier heights, while for 
proteins with AQ* < 0.1 (lpgb, laOn, lcoa, lshg) the model underestimates the barrier 
height. (See Fig. [2J ) This trend may be particular to form of cooperativity used in this 
model. 

Direct comparison between theoretical and measured 0-values shows that cooperative 
routes generally have significantly higher correlation with experiment. Following Garbuzyn- 
skiy et al. 27 , we make a distinction between contact maps obtained from native structures 
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determined by x-ray crystallography and those from the first model of an NMR structure 
or minimized averaged NMR structure. Overall, the theory predicts values for studied 
x-ray structures reasonably well. Still, there are exceptions. Of the 11 x-ray structures (see 
caption in Fig. [2]), two proteins( lshg, lten) have large negative correlations. The average 
correlation coefficient for nine remaining proteins increases from 0.33 (for non-cooperative 
routes) to 0.6 (cooperative routes). Predictions of 0- values for NMR determined structures 
is significantly worse with the average 0.1 for both non-cooperative routes and cooperative 
routes. In Fig. [5] , we give three examples for x-ray structures and NMR structures 
respectively. 

Folding barriers and absolute contact order. Since the prefactors are relatively 
uniform, the wide variation of relative folding rates is determined by differences in free 
energy barriers. To investigate the relationship between barrier heights and native topology, 
we consider the correlation between the free energy barriers and the absolute contact order—. 
Fig. E] shows that the calculated barrier height is highly correlated (r = 0.9) with absolute 
contact order when the cooperativity term included in the model. The barrier heights 
calculated without cooperativity does not show significant correlation with absolute contact 
order. This indicates that the relationship between native topology and the folding rate is 
sensitive to the rigidity of the folding nucleus. This may in fact be a robust result, largely 
independent of the details of reasonable potentials that increase local cooperativity between 
native contacts 18 . 

Assuming the prefactor is roughly uniform, the range of measured rates for this protein set 
correspond a range of free energy barrier heights of about ~ lAk-^Tf, in agreement with the 
calculated barriers. In contrast, the range of barriers for the non-cooperative routes spans 
only ~ 5k-BT{. Interestingly, this is the same range determined through coarse-grained Go- 
model simulations^ 1 ^ 1 ^. Furthermore, the low correlation between contact order and barrier 
heights of non-cooperative routes is also reminiscent of results from Go-model simulations^. 
Together, these results suggest that the cooperativity of typical Go-model simulations based 
on two-body pair potential is too low 1 — iiLi&ii2 

I. CONCLUSION 

The repulsive potential between residues in proximity to native contacts is a convenient 
way to alleviate sensitivity on the excluded volume strength in the original model. This was 
successful because the potential enhances cooperativity of the model. Our point of view is 
that the nature of the interface of the folding nucleus is key in determining the behavior of 
folding rates and mechanisms, regardless of the specific form of cooperative interactions or 
the microscopic origins. If the qualitative results from this study can be extended beyond 
this variation model, it is likely to be limited to models that enhance cooperativity locally. 
Because these results are robust with respect to the excluded volume strength U(0), the 
model lacks flexibility to explore a wide range of surface tensions. It will be very interesting 
to see if these conclusions hold when the interfacial surface tension is controlled directly 
through, for example, the formalism of density functional theory of first order nucleation. 
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FIG. 1: Cooperativity of local and non-local contacts. (a)Localization of a short-ranged pair induces 
little cooperativity because the monomer density of the surrounding residues is not significantly 
altered by a partially formed local contact . (b)In contrast, if the contact pair is non-local sequence, 
this bring together larger regions residues (of the order of the persistence length) in proximity, in- 
creasing the cooperativity . The dependence of the cooperativity on sequence length is reminiscent 
of cooperative desolvation between folded segment o 6 ° l61 . 
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FIG. 2: (color) Comparison between experimental and calculated folding rates. The correla- 
tion coefficient is r = 0.8, with p-value p = 4.4 x 1CP 7 . (circle denotes a— protein, square 
denotes (5— protein, and diamond denotes a/?— protein.). Kinetic data for proteins lenh, lvii, 
lhdn were taken frorn^; kinetic data for the rest of proteins were taken fromi^.Here, x-ray 
structured proteins are: lpgb,lcoa,lcsp,lc8c,lpin, limb, lenh, lfkbjlurn^shgjlten^ldiv^lfnf 9 ,* 
lpgbl6; NMR structured proteins are: Io6x,limq,lsrl,2ptl,laps,2abd,*la0n,*2pdd,*lmef,*lpsf, 
*lwit, *lpks,* lvii,* lhdn. (Proteins not used in Rvalue analysis are indicated by * ) 
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FIG. 3: Histogram of the inverse folding prefactor tq = 1/ko for the 28 two-state proteins. Ordinate 
is number of proteins. 




FIG. 4: (color) Folding route for protein Src tyrosine kinase SH3 domain(PDB lsrl) characterized 
locally by the normalized native density pi = (p, — pi(G))/(pi(N) — Pi(G)) and the global progress 
coordinate Q = 1/-/V^ Pi-ln the center is a diagram of the protein secondary structure with five 
j3— strand f3— barrel ,three loops(RT,n-Src, distal), a diverging turn and a 3io helix, (a) Contact 
map: black squares are native contacts, and red squares are non-native pairs. The local and global 
structure along the folding route with cooperativity: (b) and without cooperativity (c).The degree 
of structural localization of each residue is reflected in the colors, linearly scaled from red (pi = 0) 
to blue (pi = 1). (d) Free energy profile as a function of Q: the red curve is folding route with 
cooperativity, and the black curve is without cooperativity. The circles denote the critical points 
defining the folding route, while the curves are steepest descents paths. Experimental ^-values 
suggest that the (52 — /33 — /34 are ordered in the transition state ensemble^ in agreement the 
structure predicted from the model. 
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FIG. 5: (color) Comparision of the structure of cooperative and non-cooperative transition state 
ensembles. (Center)The change in the local structure of the transition state ensemble characterized 
by the cross-correlation coefficient plotted against the change in the global order parameter AQ* 
. (Left and right) Typical examples of the native density profile for various values of overlap £1. In 
each example, the red line corresponds to the cooperative route, and, the black line corresponds 
to the non-cooperative route. The corresponding correlation coefficient between measured and 
predicted phi-value are lpin:0.77; llmb:0.84;lenh:0.87; 2ptl:0.0;limq:-0.1;lo6x:-0.2. 
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FIG. 6: (color) Free energy barrier /ksT{ plotted against the absolute contact order, ACO = 
V-^conX] I* — J I where the sum is over the A^" con native contacts pairs. Red points correspond to 
cooperative folding routes, and black points correspond to non-cooperative routes. Symbols have 
the same meaning as Fig. [2j For the cooperative routes, the correlation coefficient is r = 0.91 
with p-value p = 1.6 * 10 -11 ; for non-cooperative routes, the correlation coefficient if r = 0.41 with 
p- value p = 0.03. 
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Supporting Text 



In this supplement, we outline the variational model developed in ref.l and ref.2. 

Variational Model. A configuration of the protein is modeled by the N position vectors 
of the a carbons of the polypeptide backbone. Partially ordered ensembles of polymer 
configurations are described by a reference Hamiltonian corresponding to a harmonic chain 
inhomogeneously constrained to the native structure {rf} 

o ; 



3 



in 
oo 



2a 2 

The first term enforces polymeric constraints, with vf^ determined by the correlations of a 
freely rotating chairA The values of the harmonic constraints, {C}, control the magnitude of 
the fluctuations of each monomer about the native structure (i.e., the temperature factors). 
Ensembles of partially ordered configurations are represented by monomer densities rij(r) = 
pq: (5(r — i"j))o described as Gaussian distributions with variance Ga = (\5ri\ 2 ) /a 2 about the 
mean position of the ith monomer, Sj = ^GijCjrf . Here, the correlations Gij = {Sr^ ■ 
^ ', 5rj) /a 2 depend on both the polymeric and structural constraints through G" 1 = Tij + CiSij. 

The population of a partially ordered ensemble specified by the constraints {C} is 
controlled by the free energy F[{C}] = ^[{C}] — TSfjC}]. Here, T is the temperature, 
^[{C}] is the entropy loss due to localizing the residues around mean positions, and £?[{C}] 
is the energy associated with the partially ordered ensemble. The values of the variational 
constraints are determined by the critical points in the free energy surface. For each local 
minimum or saddlepoint, there corresponds to a set of N variational constraints that solve 
<9c*i-F[{C}] = 0. Transition states ensembles are identified as the saddle-points of F[{C}] 
that connect local minima (in a steepest descents sense). A folding route is the series of 
minimum-saddlepoint-minimum which connect the globule and native minima of _F[{C}]. 

O" 

Barrier Crossing Dynamics. The formation of local order along the folding route is 
characterized by the degree of localization about the native positions 

p l [{C}] = <exp[-3a N (r J -rf) 2 /2a 2 ]> . (2) 

We refer to pi as the native density. The prefactor, in turn, is determined by the growth 
rate of p;(i) along the unstable mode of the free energy. In the formalism developed in^, the 
growth rate is developed through the polymer dynamics of the constrained chain. Since the 
reference chain is harmonic, the correlation function between monomers i and j, Gij(t) = 
(rj(t) • rj(0))o, can be expressed as a sum over normal (Rouse) modes 

Gij{t ) = J2 %^ e -^* (3) 
where the coefficients Qi P and relaxation rates X p are determined by 

(r°. + c*<y q jp = x p q ip . (4) 
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Here, the monomer relaxation rate a = 3Do/a 2 is se t by the effective bond length a and 
monomer diffusion coefficient D$. 

The effective diffusion matrix corresponding to the native density dynamics, /%(t), can 
be defined through the Laplace transform of the correlation functions Cy(i) = (pi(t)pj(0)) o — 
(Pi)o(Pj)o- 

fi{uj) = C(0) • C- x {uj) ■ C(0) - wC(0). (5) 

Here, C(t) is determined by the polymer dynamics through the monomer correlations G(t). 
Since Pij(t) depends on both the structure of the constrained ensemble as well as time, this 
formalism gives a microscopic realization of the effective diffusion coefficient used in more 
general formulations of landscape theory^. 

Finally, the prefactor is given by k = \u\/2ii, where the growth rate to is the negative 
eigenvalue of 

p.ij{\u}\)Y jk u k = -\uj\u i:j (6) 

where = d 2 /3F[{C*}}/dpidpj is the curvature of the free energy with respect to the native 
density evaluated at the saddlepoint. 
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